Constrained Reinforcement Learning for Vehicle Motion Planning with Topological Reachability Analysis
نویسندگان
چکیده
Rule-based traditional motion planning methods usually perform well with prior knowledge of the macro-scale environments but encounter challenges in unknown and uncertain environments. Deep reinforcement learning (DRL) is a solution that can effectively deal micro-scale Nevertheless, DRL unstable lacks interpretability. Therefore, it raises new challenge: how to combine effectiveness overcome drawbacks two while guaranteeing stability In this study, multi-constraint multi-scale method proposed for automated driving use constrained (RL), named RLTT, comprising RL, topological reachability analysis used vehicle path space (TPS), trajectory lane model (TLM). First, dynamic vehicles formulated; then, TLM developed on basis model, thus constraining RL action state space. Second, achieved through TPS, range, discrete routing points are via RLTT. Third, designed by combining sophisticated rules, theoretical provided guarantee efficiency our method. Finally, related experiments conducted evaluate method; reduce 19.9% distance cost as compared Experimental results indicate help mitigate gap between data-driven methods, provide better performance driving, facilitate more fields.
منابع مشابه
Motion Planning for the ATHLETE Rover with Reinforcement Learning
Legged locomotion is attractive because it can enable a robot to traverse far more varied terrain than a wheeled rover is capable of. In the context of planetary exploration, this is especially attractive as the sites of greatest scientific interest tend to be characterized by difficult terrain. For example, it would be extremely difficult for a wheeled rover to make its way into a lunar crater...
متن کاملRobust Reinforcement Learning in Motion Planning
While exploring to find better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, or even catastrophic, results, often modeled in terms of reaching 'failure' states of the agent's environment. This paper presents a method that uses domain knowledge to reduce the number of failures during explorati...
متن کاملA Reinforcement Learning Approach for Motion Planning of Hopping Rovers
This paper considers the problem of navigating a hopping rover on the surface of small Solar System bodies, such as asteroids and comets. Specifically, by controlling the torques applied to three internal flywheels, the rover is able to perform controllable long-range hops and short tumbling maneuvers. The highly stochastic dynamics of bouncing in microgravity and the sequential and discrete na...
متن کاملCuriosity driven reinforcement learning for motion planning on humanoids
Most previous work on artificial curiosity (AC) and intrinsic motivation focuses on basic concepts and theory. Experimental results are generally limited to toy scenarios, such as navigation in a simulated maze, or control of a simple mechanical system with one or two degrees of freedom. To study AC in a more realistic setting, we embody a curious agent in the complex iCub humanoid robot. Our n...
متن کاملSafety-Constrained Reinforcement Learning for MDPs
We consider controller synthesis for stochastic and partially unknown environments in which safety is essential. Specifically, we abstract the problem as a Markov decision process in which the expected performance is measured using a cost function that is unknown prior to run-time exploration of the state space. Standard learning approaches synthesize cost-optimal strategies without guaranteein...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Robotics
سال: 2022
ISSN: ['2218-6581']
DOI: https://doi.org/10.3390/robotics11040081